An improved group teaching optimization algorithm for global function optimization

This paper proposes an improved group teaching optimization algorithm (IGTOA) to improve the convergence speed and accuracy of the group teaching optimization algorithm. It assigns teachers independently for each individual, replacing the original way of sharing the same teacher, increasing the evolutionary direction and expanding the diversity of the population; it dynamically divides the students of the good group and the students of the average group to meet the different needs of convergence speed and population diversity in different evolutionary stages; in the student learning stage, the weak self-learning part is canceled, the mutual learning part is increased, and the population diversity is supplemented; for the average group students, a new sub-space search mode is proposed, and the teacher's teaching method is improved to reduce the diversity in the population evolution process. and propose a population reconstruction mechanism to expand the search range of the current population and ensure population diversity. Finally, the experimental results on the CEC2013 test suite show that IGTOA has clear advantages in convergence speed and accuracy over the other five excellent algorithms.

In many engineering areas, in order to obtain the maximum economic or social benefits, the best solution needs to be sought under certain conditions, such as: In the issue of cargo transport, the planned transport scheme meets both the quantity and loading conditions and minimizes total transport cost; In the process of product production, it is required to reduce the use of manpower and equipment to maximize the profit while meeting the product specifications. Mathematically, the essence of this process is the solution of optimization problems 1 . Due to the increasingly complex optimization problems proposed in the fields of science, engineering, and economics, even if complex mathematical models are built, it is difficult to obtain more desirable results. However, scholars have found that creatures in nature can accomplish complex tasks such as predation, risk avoidance, and evolution by assembling in groups, collaborating with each other, interacting, and other simple actions 2 . It presents a kind of group intelligence from which people take inspiration and design multiple evolutionary algorithms that can better solve optimization problems. Therefore, the swarm intelligence evolutionary algorithm has become the most effective and widely used method at present 3 .
At present, in engineering application fields, such as surface roughness modeling and optimization of tungsten-copper alloys in micro-milling processes 4 ; a decision-making framework for dynamic scheduling of cyber-physical production systems based on digital twins 5 ; evolving fuzzy models of shape memory alloy wire actuators 6 , the representative group intelligent evolution algorithm is mainly as follows: Genetic algorithm (GA), Differential Evolution (DE) algorithm , Particle Swarm Optimization (PSO) algorithm, Whale Optimization algorithm (WOA), Grey Wolf Optimizer (GWO), Artificial Bee Colony Algorithm (ABC) algorithm. Among them, the GA 7 is a biological evolution process that simulates the natural selection and genetic mechanism of Darwin's theory of biological evolution, chromosomes are the main carrier of genetic material, that is, a collection of multiple genes, constitutes the genetic operation of genetic algorithm through selection, crossover and mutation. The DE 8 is proposed based on the evolutionary idea of genetic algorithm, algorithm variant vectors are generated by the parent difference vector and crossed with the parent individual vector to generate new individual vectors, selected directly with its parent individual. The PSO algorithm 9 is inspired by the social behavior of flocks or fish, each individual in a particle population is called a particle, representing a position or possible candidate solution in a multidimensional search space, updating positions by cooperating with each other until the optimal solution is found. The WOA 10 is based on the behavior of whale prey, the location of each whale represents a viable solution, and during each generation of swimming, the whales randomly choose driving or rounding up to gradually approach their prey. The GWO 11 is a group intelligent optimization algorithm for www.nature.com/scientificreports/ population reconstruction mechanism is proposed to increase the possibility of population jumping out of local optimum. Tested on the CEC2013 test set, the results show that IGTOA has certain advantages in convergence speed, convergence accuracy and stability compared with the other five optimization algorithms.The remaining structure of this article is as follows: The "GTOA" Section introduces how the original algorithm GTOA works. In the "IGTOA" Section, the improved algorithm IGATOA is proposed, and its overall structure and improvement methods are elucidated. The "Experiment and analysis" Section will improve the algorithm and the original algorithm and other excellent algorithms, based on the CEC2013 test function simulation experiments, and obtain results. Finally, "Conclusion" Section provides a concluding overview.

GTOA
In 2020, Zhang Yiying et al. proposed the GTOA through the simulation group teaching mechanism to solve the continuous function optimization problem. In this algorithm, the decision variable is equal to the discipline, and the individual is equal to the student, that is, the individual student is composed of multiple disciplines, and the fitness value is equal to the knowledge level of the student. For solving the minimization problem, the smaller the fitness value of the individual, the higher the knowledge level of the student. Group the students according to their knowledge level, combine the characteristics of the group, adopt different learning methods, and constantly increase the amount of knowledge of the students in the process of evolution. The pseudo-code of the GTOA is shown in Algorithm 1, and its key operations are described below.
Population initialization. Assume that the population size is N, and the dimension of the problem to be optimized is D, the i-th individual in the initial population X 0 = [X 0 1 , X 0 2 , ... , X 0 N ] T can be described as www.nature.com/scientificreports/ X o i = [x 0 i,1 , x 0 i,2 , ..., x 0 i,D ](i = 1,2,…,N; j = 1,2,…,D), its j-th dimension x i,j is randomly generated according to Eq. (1): Among them, Um and Lm are the upper and lower limits of the search range of the optimization problem, respectively, and rand(0,1) is a random number between 0 and 1.
Teacher assignment. For the current population X t , a teacher is determined according to Eq. (2) to teach students in each iteration.
Among them, X t 1 , X t 2 and X t 3 are respectively the three students with the best fitness value, and t is the current number of iterations.
Update good student group. The individual update of the good students group needs to go through the teacher teaching phase and the student learning phase in turn, as follows.
Teacher teaching phase. Generally, students in good group have strong ability to accept knowledge, and teachers pay attention to improving the average knowledge of the whole group when teaching. In view of this, the GTOA designed the teacher teaching method as shown in Eq. (3).
Among them,X t+1 GT i represents the individual update of student X t Gi after the teacher's teaching; F is the teaching factor, with a random value of 1 or 2; a, b, c are random numbers in [0,1], and b + c = 1; M t is the average subject knowledge of students in the good group in the t iteration process, as shown in Eq. (4).
If the knowledge level of students is improved after teaching, the subject knowledge of students should be updated; otherwise, the original students should remain unchanged. The details are shown in Eq. (5).
Student learning phase. According to Formula (6), students learn from each other within the group to acquire new knowledge of each subject, and decide whether to update the current individual students according to the method described in Eq. (7). Among them, e and g are two random numbers within [0,1]; X t+1 GS i is the individual updated by X t+1 GT i after learning through the student phase during the t + 1 iteration; X t+1 GT j is another individual student randomly selected in this group, and j ≠ i.
Update average student group. Similar to the good group, the individual renewal of the students in the average group also needs to go through the teaching phase and the learning phase successively. Among them, the learning phase of the students in the average group is exactly the same as that of the good group. And according to the differences in the knowledge level of the two groups, the teaching phase of the teachers is different, which is as follows.
In view of the relatively poor knowledge level of students in the average group, teachers are more inclined to improve the knowledge level of individual students in the learning process. The GTOA has developed a teaching plan for the students in the average group, such as Eq. (8). Similar to the teaching of good group teachers, after the teaching, they should also judge whether to update their existing subject knowledge according to Eq. (5).

IGTOA
A large number of experiments show that for more complex function problems, similar to other swarm intelligence evolutionary algorithms, GTOA also has shortcomings such as slow convergence speed and easy to fall into local optimum. This paper deeply analyzes the reasons for the above problems, and proposes an improved group teaching optimization algorithm (IGTOA), the flow chart of which is shown in Fig. 1. Teacher assignment. As described in "Teacher assignment" Section in the teacher assignment stage of the GTOA, from the optimal individual and the average of the three better individuals, the winner is selected as the teacher of all students, and all students in the subsequent teaching stage of the good group and the teaching phase of the average group are selected. Students learn only from that teacher. Obviously, such a single-teacher learning mode will make individuals approach to it quickly, resulting in a single direction of individual explora- www.nature.com/scientificreports/ tion, rapid decline of population diversity, and easy to fall into local optimal. In order to solve the above problems, in IGTOA, all students independently select teachers for learning in the following way, as shown in Eq. (9).
Among them, T t represents the teacher assigned to the students during the teacher teaching stage; P _teacher represents teacher assignment probability. Generally, P _teacher = 0.5 can achieve better results.
By the Eq. (9) can be seen, each individual is independent of choosing the best individual and one of the three is the center of the optimal individual as a teacher, and learn from it, evolutionary direction is no longer a single individual, can better maintain the population diversity, and the best individual and all three is the center of the optimal individual carries the better the evolution of the information, will not be too much lower convergence speed. In short, this new allocation method of independent teachers can balance the convergence rate and population diversity.
Adaptive grouping. In GTOA, according to the fitness of individuals, half of the individuals with better fitness value in the whole population are divided into good group, and the rest are divided into average group. Through the overall analysis of GTOA, it can be found that the evolution of average students is relatively slow, and the main function is to provide evolutionary information for good students to explore and develop new positions, so that they can quickly approach the global optimal position. In short, the average group focused on maintaining population diversity, while the good group was mainly responsible for exploration and search.
Generally, different evolutionary stages have different requirements for algorithm performance: In the early stage of iteration, the fitness value gap between individuals is large, and the population diversity is good. Usually, it is hoped that the algorithm will quickly converge to the region where the optimal solution is located. As the evolution progresses, the fitness value gap between individuals is decreasing. , the individuals become more and more similar, and the population diversity gradually deteriorates. It is expected that the algorithm can increase the population diversity in order to have the ability to jump out of the local optimum. In order to better meet the needs of different evolutionary stages of the algorithm, in the early stage of evolution, the scale of the average group should be appropriately reduced and the scale of the good group should be expanded; and vice versa. Based on the above ideas, this paper proposes a dynamic allocation method of the number of good group and average group students as shown in Eq. (10), and the allocation process is shown in Fig. 2. Among them, N A t and N G t represent the number of students in the average group and the good group in the t-th iteration respectively; P _group is the grouping ratio, generally, P _group = 0.1 can achieve better results; ⌊•⌋ represents rounding; t is the current iteration number of times; max_t maximum number of iterations.
As can be seen from Fig. 2, in the process of 1500 iterations, the number of students in the good group was significantly higher than that in the average group in the early stage of evolution. With the increase of iterations, the number of students in the good group gradually decreased, in line with the different needs of good group size and average group size in different evolutionary stages. Improvement of update method for good group in student learning phase. GTOA divides the population into good and average groups according to the merits of the fitness value, and each group goes www.nature.com/scientificreports/ through the teacher teaching phase and the student learning phase accordingly, and the two groups share the same student learning strategy. Unlike the teacher phase, the student learning phase does not learn from good teachers, but from other individuals in the group and themselves. Therefore, the student learning phase is mainly responsible for maintaining the population diversity within the group. An in-depth analysis of the way in which individuals are updated at the student learning stage as shown in Eq. (6) shows that the new individual is actually constituted by "Original individual + Mutual learning + Self-learning". Among them, the mutual learning part randomly selects other individuals in the group to learn, which can provide population diversity to a certain extent; the self-learning part is that the current individual learns from the individual who has not undergone the teacher stage, if the individual is not retained after going through the teacher stage, the current individual is exactly the same as the individual who experienced the teacher stage before. Obviously, self-learning is completely meaningless, and if the individual is retained after going through the teacher stage, the current individual is better than the individual before the teacher stage, and it is difficult to learn from it to produce new individuals who are better than themselves, then the self-learning stage will still not play a role. In short, the self-learning component of GTOA is extremely weak. In order to further enhance its population diversity, the self-learning part is abolished, learning from other individuals is added, and the new student learning stage individual renewal method is proposed as shown in Eq. (11).
where X t+1 GS j , X t+1 GT k are two different individuals randomly selected in the good group, i ≠ j ≠ k, respectively, and the definitions of parameters a and b are shown in Eq. (12).
where, sgn(•) is a symbolic function. Improvement of update method for average student group. As described in "Adaptive grouping" Section, the good group in GTOA is mainly responsible for exploration and search, while the average group focuses on maintaining population diversity. Similar to the updating method of students in the good group, students in the average group also conduct a complete search in the D-dimensional search space. Such a wide range of communication in the search space is very likely to make the population quickly close to several superior individuals and gather in a certain area or several regions, resulting in a serious loss of population diversity. If the average group of students does not conduct a complete large-scale search in the D-dimensional search space, but only conducts a small-scale search in some dimensions, it is easy to overcome the above shortcomings.
Based on the above analysis, average group of students in teachers' teaching and students' learning phase are small range subspace search, among them, the average group of students to study with good group of students learning phase in the same way, and the new design of ordinary teachers teaching phase and sub-space model way choice of dimension to search in the specific as follows.
Dimension selection in subspace patterns. In order to maintain the diversity of the population as much as possible, in the subspace search mode we designed for the average group of students to update, the number of dimensions to be updated by each individual and the specific dimensions are randomly generated, as follows: First, for each individual, a random integer k is randomly generated in [1,D], which is the total number of dimensions that the individual needs to update; Then, k random integers are randomly generated in the dimension space [1,D], and the subsequent subspace search will be performed in its corresponding dimension.
To further understand the dimension selection in the above subspace pattern, a concrete example is shown in Fig. 3. Given that the dimension of the problem to be optimized is 30, for example, for individual X2, the number of randomly generated dimensions to be updated is 5, and 5 random integers are randomly generated within [1,30], which are {4,12,5,29,17} respectively, indicating that individual X2 will only search on the dimension {4,12,5,29,17} when conducting subspace search, the other dimensions don't change.
Improvement of teacher teaching phase. It can be seen from Algorithm 1 that the students in the good group and the average group in GTOA only rely on the students in this group to update their subject knowledge. When the next iteration is regrouped, the good group and the average group can exchange information with each other. However, since the students in the good group are better than the students in the average group, after the knowledge update, the students in the good group are basically better than the students in the average group. Therefore, even after regrouping, only a very small number of students from the average group entered the good group. Obviously, students in the good group only absorbed a very small amount of the evolutionary information of the average group, and did not really get the diversity supplement. Basically, they still relied on their original evolutionary information to evolve, which was easy to quickly converge to a local optimum. In order to make the good group get the full diversity supplement, it is necessary to further improve the subject knowledge of the students in the average group and increase the opportunity for them to enter the good group. In view of this, the teaching methods of average group teachers are improved as follows, as shown in Eq. (13). represents the teachers assigned according to Eq. (10), and one teacher is independently selected from each dimension; X t+1 GT m,j represents an individual selected randomly from X t+1 GT , and likewise, an individual selected independently from each dimension; F 1 is the random number between [1,2], and a is the random number between [0,1].
A comparison of Eqs. (8) and (13) can be found that: First, in the GTOA, the learning objects in the average group are all teachers. In this section, all the students of the teachers in the good group are also listed as the learning objects. Because the good group students are the result of learning from teachers, the gap between the average group and the good group is further shortened, and the possibility of the average group gene flowing into the good group is increased. Second, compared with only learning from teachers themselves, the new teaching method of teachers has more combinations of learning objects, which greatly improves the diversity of students in the average group. Although the evolutionary information of some outstanding group students will be mixed into the average group students, the evolutionary genes flowing in different dimensions come from different outstanding students, and the genes of the average group students are completely preserved in the dimension without subspace search, so that there is a big difference between the evolutionary information of the average group students and the evolutionary information of the good group students. Therefore, when the students of the average group flow into the good group, they can be supplemented with a certain population diversity. To sum up, the new teaching method of average group teachers proposed in this section has certain advantages.
In order to further understand the subspace search mode of teacher teaching phase and student learning phase in the average group, a specific example is given in Fig. 4. Given that the dimension of optimization problem is 10 and the number of individuals is 5. In the teacher teaching phase, the number of dimensions to be updated in the randomly generated subspace of individual XA2 is 3, and 3 dimensions are randomly selected for subspace search, including {2, 5, 9}, and other dimensions remain unchanged. For dimension 2, random teacher T1 and good group student XG2 learn from their 2-nd dimension according to formula (13); for dimension 5, teacher T2 and the 5th dimension of good group students XG3 were randomly selected for learning; and for dimension 9, the teacher T1 and the 9-th dimension of good group XG4 were randomly selected for learning. Assuming that the newly generated individual is superior to the original, the original individual will update its knowledge of each subject, otherwise unchanged.
It should be noted here that the students in the average group adopt the same learning method as the good group students in the learning stage, as shown in Eq. (11). The difference is that they do not use the full space search mode, but use the subspace search mode.
Population reconstruction mechanism. A large number of experimental studies have shown that, similar to other swarm intelligence evolutionary algorithms, for very complex optimization problems, with the progress of evolution, GTOA also has the defects of slow evolution and difficulty in obtaining the global optimal solution. In order to solve this problem, this section proposes a new population reconstruction mechanism, which mainly includes the starting conditions of the population reconstruction mechanism and the population reconstruction strategy. The details are as follows.  ... www.nature.com/scientificreports/ Initiating conditions of population reconstruction mechanism. As we all know, swarm intelligence evolutionary algorithm will show the following obvious characteristics when it evolves slowly: in the process of successive iterations, the optimal value obtained by the population does not change. Therefore, this excerpt uses the change of global optimal value as the starting condition of population reconstruction mechanism. Specific methods are as follows: Initialization parameters change_flag = 1 and flag, where change_flag is used to record the number of times that the global optimal value does not change continuously, flag is the threshold that determines the local optimal value; Then, the Euclidean distance of the globally optimal individual in two consecutive iterations is calculated according to Eq. (14), if move = 0, change_ flag = change_ flag + 1, otherwise, change_ flag = 1. When change_ flag = flag, the population reconstruction policy is started and change_ flag is set to 1.
Among them, bestX t and bestX t−1 represent the optimal individuals in the t-th and (t-1)-th iterations, respectively.
Population reconstruction strategy. Experimental results show that for complex optimization problems, the improved algorithm proposed in this paper has a good ability to maintain population diversity. Even if the phenomenon of slow evolution occurs, the individual differences in the current population are relatively obvious, and the clustering and high similarity of individuals are not presented. Through in-depth analysis of the overall optimization process of GTOA, it is not difficult to find the essential reasons for the above phenomenon as follows: At the beginning of its evolution, GTOA started from a very limited number of individuals. Under the guidance of excellent teachers, it drove each individual to move from the area to the better area, and gradually narrowed the search range, so that outstanding individuals could use the help of a smaller range. Other individuals conduct further in-depth and refined exploration, thereby stimulating better evolutionary information to improve teachers. However, for very complex optimization problems, there are often multiple local optimal solutions, and the global optimal solution is hidden in a narrow region. In the huge search space, each individual will be drawn to several local optimal regions with a high probability, because the local optimal values are relatively similar, it is difficult for each individual to jump out of the local optimal region. Although good population diversity can be maintained, the search area cannot be reduced, thus it is difficult to provide motivation for teachers to further fine exploration. To sum up, in order to force teachers to have the motivation to search carefully, the search scope must be appropriately narrowed and other evolutionary information must be introduced.
Generally, GTOA in slow evolution of several iterations, the better individual has acquired when the area before the local optimum, obviously, compared with extensive search in large search space, surrounded by where l is the scaling coefficient of the lens, typically, l = 10.
where k is a random integer of [1, N1]. The above population reconstruction methods have the following advantages: First, in several iterations, although the optimal value has not changed, sufficient communication has been carried out between individuals, and effective evolutionary information in poor individuals has been contained in excellent individuals. Therefore, retaining some excellent individuals and removing half of the poor individuals can basically represent the current evolutionary information and will not affect the exploration ability of the population. Second, the reverse learning is carried out on the individuals in the sub-population pop2, which not only further effectively narrows the search range, but also introduces other evolutionary information, which further provides the impetus for the evolution of outstanding individuals. Third, the individual parts and even all dimensions of the population pop3' are randomly selected from the best part of the individuals, and obviously, they all belong to the partially sampled individuals in the region formed by the best part of the individuals. It not only effectively reduces the search area, but also because they are the recombination of the various dimensions of the better individuals, although they contain part of the evolutionary information of the better individuals, they are quite different from them, which supplements the population diversity to a certain extent, so that the algorithm has the power to further fine-tune the search. Figure 6 shows the comparison before and after population reconstruction. Assuming that the number of individuals is 50, the problem to be optimized is a sphere function with a dimension of 2. It can be seen that the search range is significantly reduced after population reconstruction, and it is closer to the global optimum.
The complexity analysis of the algorithm. The population size of IGTOA algorithm is N; the number of students in excellent and ordinary groups is N good and N average respectively; and the problem dimension is D. As shown in Fig. 1, the IGTOA algorithm includes the following four main steps: good group teacher stage (T _gt ), good group student stage (T _gs ), general group teacher stage (T _at ), general group student stage (T _as ). The time overhead of the IGTOA algorithm also mainly comes from these four operations. At each generation of the IGTOA algorithm runs, the worst-time complexity of the above four operations is analyzed as follows: The good group teacher stage (T _gt ) needs to calculate N good × D times at most formula (3), so its corresponding worst time complexity is O(N good × D); the good group student stage (T _gs ) needs to calculate N good × D times at most formula (11), so its corresponding worst time complexity is O(N good × D); the average group teacher stage

Experiment and analysis
In this part, we test the performance of the proposed IGTOA algorithm through four experiments: the first is the parameter sensitivity analysis; the second is the effectiveness of each improvement strategy; and the third is the performance comparison with other excellent intelligent optimization algorithms. The fourth is to compare the results of each algorithm in engineering applications.
"Effect of the parameters on the algorithm", "Proof of the effectiveness of the improvement measures", "Compared with other excellent algorithms" Sections is tested on a CEC2013 test set containing 28 functions. According to the characteristics of these functions, they can be divided into three groups: the first group is the unimodal function F1-F5, the second group is the multimodal function F6-F20, and the third group is the combination function F21-F28. Detailed information about this test set can be found in the literature 32 . Meanwhile, the algorithms are implemented in Matlab 2021a. All the tests are conducted on a personal computer equipped with a core i7-11800H CPU (2.30 GHz) and a 16.0-GB memory.
Effect of the parameters on the algorithm. The proposed algorithm IGTOA adds to the original GTOA algorithm with the following parameters: including change_flag, Pg, l, N1 and N2. When examining the influence of one of the above parameters on the performance of the IGTOA algorithm, the parameter is set to 5 sets of values, and the other parameters remain unchanged. In all experiments, the population size N = 50, the problem dimension D = 30, the maximum function evaluation number MaxFEs = 5000*D, and the other parameters were set as follows: (1) When examining the effect of change_flag on the performance of the IGTOA algorithm, change_flag is set to 10, 30, 50, 70 and 100, respectively. Other parameters are set as follows: All of the above experiments were run independently 30 times on the CEC2013 test set, and the average value of the optimal value obtained from each independent run when the same number of preset function evaluation times was reached was counted. The specific data are shown in Tables 1 and 2. In Tables 1 and 2, the parameters that perform best on each function are shaded, and the number of functions is calculated on the last line. In order to further compare the performance of the algorithm, the parameters of the data in Tables 1 and 2 are shown as in Fig. 7, in which the height of the bar graph represents the size of the average rank of each algorithm, the higher the bar graph, the higher the average rank and the higher the rank, the overall performance of the algorithm of the parameter at the value.
According to the data in Table 1, when change_flag = 30, the relatively best results are achieved on the 22 functions; when change_flag is 10 or 70, it works relatively best on 16 functions; and when the change_flag = 50 www.nature.com/scientificreports/ or 100, the relatively best results are achieved on 14 and 13 functions, respectively. As can also be seen from the results in Fig. 7a, IGTOA algorithm performance is optimal when change_flag = 30, when change_flag = 10,50 and 70, IGTOA algorithm performance is little different, but it is inferior to IGTOA algorithm performance when change_flag = 30, while when change_flag = 100, IGTOA algorithm performance is not satisfactory. In "Conclusion" Section, the IGTOA algorithm is more sensitive to the parameter change_flag, and it performs the best when change_flag = 30. Analyzing other data in Tables 1 and 2 and Fig. 7 in the same way, we can find that the IGTOA algorithm is sensitive to both parameters l and N1 and N2, with slightly decreased sensitivity to parameter Pg, and the IGTOA algorithm performs best when Pg = 0.1, l = 10 and N1 = N2 = 0.2*N. To sum up. when the parameters change_flag, Pg, l, N1, N2 are set to 30, 0.1, 10, and 0.2 N,0.2 N, respectively, IGTOA has a good optimization effect. If you want to obtain a better effect for a certain actual optimization problem, the above parameters can also be adjusted several times.

Proof of the effectiveness of the improvement measures.
To verify the effectiveness of each improvement measure in Parts 3.1-3.3, a corresponding improvement strategy was removed in IGTOA, five new improvement algorithms were formed, including the improved algorithm for removing the teacher allocation strategy of "Teacher assignment" Section in IGTOA, the improved algorithm for removing the adaptive grouping strategy of "Adaptive grouping" Section in IGTOA, the improved algorithm for removing the good group improvement strategy of "Improvement of update method for good group in student learning phase" Section in IGTOA, the improved algorithm for removing the common group improvement strategy of "Improvement of update method for average student group" Section in IGTOA and improved algorithms for the population reconstruction strategy of removing "Improvement of teacher teaching phase" Section in IGTOA. For simplicity, the above five new algorithms are called IGTOA1, IGTOA2, IGTOA3, IGTOA4 and IGTOA5, respectively. The above five improved algorithms and IGTOA were compared on the CEC2013 test set. To ensure the fairness of the comparison, the parameters of each algorithm are set exactly the same, as follows: population size N = 50, problem dimension D = 30, maximum evaluation number MaxFEs = 5000*D, change_flag = 30, Pg = 0.1, l = 10, N1 = N2 = 0.2*N. The mean and variance of the optimal values obtained by running the algorithms www.nature.com/scientificreports/ independently 30 times on each function are counted, and the specific results are shown in Table 3. Among them, the data from outside and in parentheses represent the mean and standard deviation of the optimal values obtained in 30 independent experiments, respectively. The bold value indicates that the performance of the other improved algorithms are significantly worse than IGTOA on the corresponding functions, counting the number of functions for which the improved algorithm is significantly worse than IGTOA in the penultimate row, and giving the average rank and the ranking results of IGTOA and the other five improved algorithms for Friedman test in the last two lines. As can be seen from Table 3, the functions of the five improved algorithms performing perform IGTOA performance are 14, 10, 15, 14 and 11 respectively. This shows that the five improvements proposed in "Teacher assignment", "Adaptive grouping", "Improvement of update method for good group in student learning phase", "Improvement of update method for average student group" , "Improvement of teacher teaching phase" Sections have some effectiveness. Furthermore, the rank of each algorithm obtained by Friedman detection is visible, with the smallest rank of the corresponding improvement algorithm after removing the good group improvement strategy of "Improvement of update method for good group in student learning phase" Section compared to IGTOA. This shows that among the five improvements, the improvement measure in "Improvement of update method for good group in student learning phase" Section has the greatest impact on the performance of the IGTOA algorithm, while several other improvement algorithms have little difference on the performance of IGTOA algorithm.
Compared with other excellent algorithms. In order to fully investigate the performance of IGTOA algorithm, the algorithm, basic GTOA and four recently proposed representative optimization algorithms are analyzed in four aspects of convergence accuracy, convergence speed, stability and running time. Algorithms for comparison include IATTP(2021) 16 ; MSMPSO(2018) 12 ; ADN-RSN-PSO(2017) 11 ; ESCA(2022) 17 . To ensure the fairness of the algorithm, the number of individuals in the population is N = 50, the test problem dimen- www.nature.com/scientificreports/ sions D = 30, 100, the maximum function evaluation times MaxFEs = 5000*D, and the remaining parameters are shown in Table 4.
Comparative experiment on algorithm convergence. Tables 5 and 6 respectively show the experimental results of each algorithm for solving the 30-and 100-dimensional function problems on the CEC2013 test set, with in-and out-parenthesis values being the standard deviation and mean of the optimal values obtained in 30 independent experiments. Among them, the bold data indicates that the corresponding algorithm has obtained the best solution effect on the test function, and the last line counts the number of functions for obtaining the best performance of each algorithm. To further compare the performance differences of the above algorithms, a Friedman test with a significant level of 0.05 was performed on the above data, and the specific results are shown in Table 7. Among them, the size of the p-value reflects the difference between the two algorithms in the current function. When the p-value is less than 0.05, it indicates that the IGTOA and the corresponding comparison algorithm have obvious differences in the current function, while otherwise, there is no significant difference between the two algorithms. In addition, the "+/=/−" in the last row indicates the number of functions that the IGTOA algorithm is significantly better than, not significantly different from, and significantly inferior to the corresponding comparison algorithm, respectively. As can be seen from the data in Tables 5, 6   www.nature.com/scientificreports/  show that IGTOA has obvious advantages in convergence accuracy over the remaining five algorithms. In addition, as the dimension of the optimization problem increases, the advantages of the IGTOA algorithm are also greater. To further compare the comprehensive performance of each algorithm on all functions, Table 8 presents the results of Friedman detection. As can be seen from the Friedman test results in Table 8, the rank of IGTOA is significantly lower than the other five methods, indicating that the IGTOA algorithm performs best in terms of convergence accuracy.For 30-dimensional function optimization problems, the comprehensive performance of each algorithm is IGTOA > GTOA > IATTP > ADN-RSN-PSO > MSMPSO > ESCA; for 100-dimensional function optimization problems, the comprehensive performance of each algorithm is IGTOA > IATTP > GTOA > ADN-RSN-PSO > MSMPSO > ESCA.
Comparative test of the convergence rate of the algorithm. In order to compare the convergence rate of the algorithm more intuitively, Fig. 8 gives the iterative process curve where each algorithm is run randomly once when the test function dimension is 30.The horizontal and vertical coordinates represent the logarithm of the function evaluation times and the fitness values, respectively.Parameter settings for each algorithm are performed as in Table 4.
As can be seen in Fig. 8. For the functions F1, F3, F5, F7, F9, F11, F12, F13, and F20, the IGTOA all converge to the theoretical optimal results; the ESCA converges to the theoretical optimum on F3, F9, F11, and F13; the IATTP algorithm converges to the theoretical optimum on F3, F7, F9, F11, F12, and F13; the GTOA converges to the theoretical optimum on the F3, F7, F9, F11, and F13; the ADN-RSN-PSO converges to the theoretical optimum only on the F11; while MSMPSO does not obtain the theoretical optimal results on any function. Compared with IGTOA, GTOA and IATTP showed faster convergence on F9, F11 and F13, while ESCA converged only faster on F11, while other algorithms converged slower on the remaining functions, including F1, F3, F5, F7, F12 and F20. For the remaining 19 functions, each algorithm converged to the local optima, including F2, F4, F6, F8, F10, F14-F19 and F21-F28. For F2, F14, F19 and F21, IGTOA only converged slightly slower than GTOA in the early evolution, but all faster than the other four algorithms, especially in the late evolution, IGTOA showed faster convergence than the other five algorithms. For F4, IGTOA converges slower than IATTP, but faster than the other 4 contrast algorithms. For F15, F23 and F26, the IGTOA showed the fastest convergence compared with the other five algorithms, whereas the IGTOA decreased.Later in evolution, IGTOA converged only faster on F15 than IATTP, slower on F23 than GTOA and ADN-RSN-PSO, and only slightly slower than MSMPSO on F26. For F16, IGTOA converges only converged slightly slower than MSMPSO in early evolution, but by later evolution, IGTOA showed the fastest convergence. For F24, IGTOA showed the fastest convergence rate in the early evolution, slowing down as evolution progressed and being comparable to that of IATTP and GTOA. For F27, IGTOA converged slightly slower than MSMPSO and IATTP in the early evolution, the convergence of each algorithm decreased, but ADN-RSN-PSO decreased more slowly, and by the later evolution ADN-RSN-PSO, MSMPSO and IATTP all converged faster than IGTOA. But for other functions, including F6, F10, F17, F18, F22, F25, and F28, IGTOA showed the fastest convergence rate compared to the other five evolutionary algorithms. In Conclusion Section, IGTOA has some advantages in convergence speed over the other five algorithms.
Comparative test of the algorithm stability. To intuitively compare the stability of each algorithm, we draw the box plot of the optimal results obtained from 30 independent runs of each algorithm.Limited to space, this section selects only nine different types of functions for comparison, including: F1, F2 and F5 in uni-modal functions; F6, F14 and F16 in multi-modal functions; and F22, F25 and F28 in combined functions. As shown in Fig. 9. www.nature.com/scientificreports/ Figure 9 follows from the fact that for the uni-modal functions F2 and F5 and the multi-modal functions F22 and F14, IGTOA is more stable than the other five algorithms.For the uni-modal function F1, the stability of IGTOA and IATTP was flat, and both significantly outperformed the other four algorithms, including GTOA, MSMPSO, ADN-RSN-PSO, and ESCA.For multi-modal function F16, IGTOA is only slightly less stable than GTOA; for combined function F25, IGTOA is slightly less stable than ESCA, but its solution accuracy is  Table 9. The dimension of the test function is 30, the maximum number of evaluation is Max_FEs = 5000*D, and the other parameter settings are shown in Table 4.
As seen from Table 9, for the unimodal function F1-F20, I GTOA runs for slightly longer times compared to GTOA, MSMPSO, ADN-RSN-PSO, and ESCA. For the combined functions F21-F28, the running time of each algorithm is not very different. However, the running time of the algorithms is not much different, which means that the time complexity of I GTOA is slightly higher compared with G T O A and other contrast algorithms. This is due to the multiple improvement strategies employed by IGTOA, requiring more manipulation when looking for better individuals. Combined with the convergence rate, with the same convergence accuracy, IGTOA does not increase compared with the other algorithms.   www.nature.com/scientificreports/ Comparison of the engineering application effect. In order to further compare the effects of IGTOA algorithm and other comparison algorithms in practical application, this section uses each algorithm to handle the cooperative beam forming optimization problem. The cooperative beam forming optimization problem is a typical problem in the antenna array. By optimizing the amplitude and phase of the emission signal weight of each cooperative node, the peak side valve level PSL minimization as shown in formula (17) is realized. www.nature.com/scientificreports/ where, AF(θ, w) represents the array factor, as shown in formula (18). φ is the main beam direction. θ SL is the direction corresponding to the peak point in the range θ ∈ [−π, φ) ∪ (φ, π ] beam chart except for the main lobe peak point, is called the lateral lobe direction. The denominator AF(φ,w) is the main beam power and the molecule max |AF(θ SL , w)| is the maximum beam power in the side flap.
where, w k is the complex number weight coefficient of the signal emitted by the k-th cooperative node, as shown in formula (19).  www.nature.com/scientificreports/  www.nature.com/scientificreports/ where, ξ k and α k are the amplitude and initial phase of the emission signal weights of the k-th cooperative node, respectively, and ξ ∈ [0, 1] , α ∈ [−π , π ]. The beam forming scenario in this section is shown in Fig. 10. Among them, the wavelength of the sending signal is , and the six cooperative nodes are distributed in the circle domain with a radius of 4 , and one cooperative node is located in the center of the circle domain. Each algorithm is optimized as the objective function shown in formula (17). For comparative fairness, in this experiment, the problem scale N is 50, the maximum function evaluation times Max_FEs = 5000*D, the number of nodes k = 6, the polar radius r k = 4, and other parameters are shown in Table 4. In order to avoid the adverse effects of contingency on the algorithm evaluation, each algorithm runs independently for 10 times, and selects the best collaborative beam optimization scheme corresponding to the PSL median of each algorithm is compared. Figure 11 intuitively gives the beam diagram of IGTOA and each comparison algorithm in the right Angle coordinate system, and then the PSL corresponding to each algorithm is annotated in the graph.
As can be seen from Fig. 11, for the above collaborative beam forming scenario, the best PSL obtained from GTOA, IATTP, MSMPSO, ADN-RSN-PSO, ESCA, and IGTOA are: − 3.7667 dB, − 4.0599 dB, − 3.1973 dB, − 4.0369 dB, − 3.7997 dB and − 4.3917 dB, respectively. Each algorithm achieved better cooperative beam optimization than unoptimized (− 1.8909 dB), and IGTOA achieved the best synergistic beam optimization than the other five algorithms. In Conclusion Section, the proposed IGTOA also has excellent performance in engineering applications.

Conclusion
This paper proposes an improved algorithm-IGTOA, which assigns teachers by probability and introduces different excellent genes in the group to ensure the population diversity; at the same time, the adaptive grouping method, combined with the different learning abilities of students in the two groups, put forward suitable search methods and learning methods, balancing the diversity loss rate in the evolution process and the algorithm convergence rate; in addition, this paper proposes a population reconstruction mechanism that starts with whether the population optimal individual has continuous changes and provides new genes for the population while maintaining excellent genes, which ensures the convergence rate of the algorithm and better maintains the population diversity; finally, simulation results from multiple experiments of this algorithm in the CEC2013 test suite show that IGTOA has good comprehensive performance, and IGTOA has obvious advantages in convergence speed and solution accuracy compared with many other comparative algorithms.

Data availability
The datasets used or analysed during the current study available from the corresponding author on reasonable request.